Rule induction in data mining: effect of ordinal scales

نویسندگان

  • Helen M. Moshkovich
  • Alexander I. Mechitov
  • David L. Olson
چکیده

Many classi®cation tasks can be viewed as ordinal. Use of numeric information usually provides possibilities for more powerful analysis than ordinal data. On the other hand, ordinal data allows more powerful analysis when compared to nominal data. It is therefore important not to overlook knowledge about ordinal dependencies in data sets used in data mining. This paper investigates data mining support available from ordinal data. The effect of considering ordinal dependencies in the data set on the overall results of constructing decision trees and induction rules is illustrated. The degree of improved prediction of ordinal over nominal data is demonstrated. When data was very representative and consistent, use of ordinal information reduced the number of ®nal rules with a lower error rate. Data treatment alternatives are presented to deal with data sets having greater imperfections. q 2002 Elsevier Science Ltd. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Specification of Healthcare Expert Systems Using a Multi-mechanism Rule-extraction Pipeline

The application of knowledge extraction methodologies in support of medical informatics promises interesting developments that could potentially improve many aspects of healthcare services. In this paper we outline a multi-stage rule extraction pipeline for rule-based knowledge discovery. The featured methodology would facilitate operationally straightforward extraction of symbolic rules from m...

متن کامل

Quantization of Continuous Data for Pattern Based Rule Extraction

A great deal of interesting real-world data is encountered through the analysis of continuous variables, however many of the robust tools for rule discovery and data characterization depend upon the underlying data existing in an ordinal, enumerable or discrete data domain. Tools that fall into this category include much of the current work in fuzzy logic and rough sets, as well as all forms of...

متن کامل

Applying Ordinal Association Rules for Cleansing Data With Missing Values

Cleansing data of errors is an important processing step particularly when integrating heterogeneous data sources. Dirty data files are prevalent in data warehouses because of incorrect or missing data values, inconsistent attribute naming conventions or incomplete information. This paper improves the data cleansing ordinal association rules technique by proposing a solution for the missing val...

متن کامل

A survey of Bayesian Data Mining - Part I: Discrete and semi-discrete Data Matrices

This tutorial summarises the use of Bayesian analysis and Bayes factors for nding signi cant properties of discrete (categorical and ordinal) data. It overviews methods for nding dependencies and graphical models, latent variables, robust decision trees and association rules.

متن کامل

Mining Association Rules in Spatio-Temporal Data

This research demonstrates the application of association rule mining to spatiotemporal data. Association rule mining seeks to discover associations among transactions encoded in a database. An association rule takes the form A ? B where A (the antecedent) and B (the consequent) are sets of predicates. A spatiotemporal association rule occurs when there is a spatio-temporal relationship in the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Expert Syst. Appl.

دوره 22  شماره 

صفحات  -

تاریخ انتشار 2002